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Abstract 

Let w be a morphic word over a finite alphabet E, and let A be a nonempty subset of S. 
We study the behavior of maximal blocks consisting only of letters from A in w, and prove 
the following: let {ik,jk) denote the starting and ending positions, respectively, of the fc'th 
maximal A-block in w. Then lYmswpf._^^{jk/ik) is algebraic if w is morphic, and rational if w 
is automatic. As a result, we show that the same conclusion holds if {ik,jk) are the starting and 
ending positions of the fc'th maximal zero block, and, more generally, of the fc'th maximal x- 
block, where x is an arbitrary word. This enables us to draw conclusions about the irrationality 
exponent of automatic and morphic numbers. In particular, wc show that the irrationality 
exponent of automatic (resp., morphic) numbers belonging to a certain class that we define is 
rational (resp., algebraic). 
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1 Introduction 



The irrationality exponent /x(^) of an irrational number ^ is the supremum of the real numbers ii 
such that the inequality 

1 

< — 



has infinitely many solutions in rational numbers p/q. 

It follows from the theory of continued fractions that the irrationality exponent of every irra- 
tional number is greater than or equal to 2, and from the Borel-Cantelli lemma that it is precisely 
equal to 2 for almost all real numbers (with respect to Lebesgue measure). However, to determine 
the irrationality exponent of a given real number ^ is often a very difficult problem. Naively, we 
could hope to be able to read it off from the expansion of ^ in some integer base b, but this is 
almost never the case (see |3j for a thorough discussion). 

Let 5 > 2 be an integer. Recently, Bugeaud [4] constructed a class of real numbers whose 
irrationality exponent can be read off from their base-6 expansion. This class includes numbers of 
the form 

for a strictly increasing sequence n = {nj)j>i of positive integers satisfying nj^i/rij > 2, for j > 1. 
To obtain good rational approximations to we simply truncate the above sum. Thus, we set 



It then follows from 



that 



bn.J 



< 



(5"j)»^j+i/"j 



Ai(Cn) ^ limsup Uj^i/rij 



Shallit [D] proved that the continued fraction expansions of any such £„ can be given explicitly, and 
Bugeaud [1| proved its irrationality exponent is given by 



Ai('^n) = limsup rij+i/n 



(1) 



and hence can be read off from its base-6 expansion. This simply means that the best rational 
approximations to .^n are obtained by truncating its base-6 expansion. In this paper, among other 
results, we use this method to study the irrationality exponents of automatic and morphic numbers. 
We let C denote the class of all real numbers £„ as above. 

Pure morphic words are infinite words generated by iterating a morphism defined over a finite 
alphabet. Let S be a finite alphabet, let e denote the empty word, and let /i : S* ^ S* be a 
morphism. If there exists a letter a € S such that h{a) = ax for some x G S"*", and furthermore, 
/i"(a) 7^ e for all n > {h is prolongable on a), then the sequence a,h{a),h'^{a), . . . converges 
as n tends to infinity to the infinite word h'^{a) = axh{x)h'^{x) • • • , which is a fixed point of the 
morphism h. Such infinite fixed points are called pure morphic words. An infinite word is morphic 
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if it is the image under a coding (that is, a letter-to-letter morphism) of a pure morphic word; it is 
automatic if it is morphic, and the underlying pure morphic word can be generated by a uniform 
morphism, that is, a morphism that maps all letters to words of equal length. (Note: the standard 
definition of automatic words, or sequences, uses finite automata. See, e.g., [2], Chapter 5].) A real 
number is automatic (resp., morphic) if its expansion in some integer base 6 > 2 is an automatic 
(resp., morphic) word over the alphabet Sf, = {0, 1, . . . , 6 — 1}. 

Recall that a Liouville number is a real number ^ satisfying /x(^) = oo. Adamczewski and 
Cassaigne |T] established in 2006 that the irrationality exponent of an automatic number is always 
finite, that is, automatic numbers are not Liouville numbers. In [1], Bugeaud used Eq. ([1]) to show 
that any rational number > 2 is the irrationality exponent of some automatic number. These 
two results motivate the following question: 

Problem 1. Determine the set of irrationality exponents of automatic numbers. In particular, is 
the irrationality exponent of an automatic number always rational? 

Unfortunately, we are unable to settle this problem. However, our Theorem [1] below implies 
that the irrationality exponent of any automatic number in the class C is rational. Consequently, to 
give a negative answer to Problem 1 we would need to use a radically different method to construct 
automatic numbers. 

As automatic numbers form a subclass of the morphic numbers. Problem [T] can be naturally 
extended as follows: 

Problem 2. Determine the set of irrationality exponents of morphic numbers. 

Theorem [1] implies that the irrationality exponent of every morphic number in the class C is 
always an algebraic number. Moreover, using the same method as in |4], we are able to show that 
every Perron number > 2 is the irrationality exponent of some morphic number (recall that a 
Perron number is a positive real algebraic integer that is greater in absolute value than all of its 
conjugates). However, Problem [2] remains unsolved. 

Theorem [1] is proved through a combinatorial study of maximal A-blocks in automatic and 
morphic words, where A C S is a subalphabet. We find the combinatorial results interesting in 
their own right. 

Our paper is organized as follows. In Section [2] we give some definitions and state the main 
theorems, as well as some open problems. In Section [3] we analyze the structure of A-blocks in 
pure morphic words. In Section |4] we apply the results of Section [3] to morphic words in general. 
In Section [5] we construct, for a given Perron number /x > 2, a morphic number £ C such that 

m(0 = 

2 Exponents of Diophantine approximation and maximal blocks 

To carefully investigate the question whether one can read off the irrationality exponent of a real 
number from its expansion in some integer base, Amou and Bugeaud [3j introduced new exponents 
of Diophantine approximation. Throughout the present paper, || • || denotes the distance to the 
nearest integer and [-J denotes the greatest integer function. 

Definition 1. Let ^ be an irrational real number. Let b be an integer with b > 2. We let Vh{^) 
denote the supremum of the real numbers v for which the equation 

m\\<{bn"' 
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has infinitely many solutions in positive integers n. We let ^{,(0 denote the supremum of the real 
numbers v for which the equation 

Wib' -l)^\\ < (5"+^)-'' 
has infinitely many solutions in positive integers r and s. 

The exponent Vb measures the accuracy with which a real number is approximable by rationals 
obtained by truncating its base-6 expansion, while v'^ measures the accuracy with which a real 
number is approximable by rationals obtained by truncating its base-6 expansion and completing 
by periodicity. 

For every irrational number ^, we have > "^6(0 ^ for b > 2, and 

> 1 + max{w^(e), 1} > 1 + max{vbiO, 1}, for b > 2. 
Furthermore, any real number 

i>i 

belonging to the class C satisfies 

Ai(0 = 1 + VbiO = 1 + v'biO = limsup nj+i/uj. (2) 

To understand the shift by 1, just observe that /x(^) — 1 is the supremum of the real numbers jj. 
such that 

\m\<q-^ 

has infinitely many solutions in positive integers q. 

The main result of the present paper is the following: 

Theorem 1. Let ^ be an irrational real number, and suppose the expansion of ^ in some integer 
base b > 2 is an automatic (resp., morphic) word over the alphabet {0,1,..., 6— 1}. Then the 
number Vb{C) is finite and rational (resp., algebraic). 

Conversely, we do not know whether for every positive algebraic number v there exist b > 2 
and a morphic number ^ such that Vb{^) = v. The next theorem provides a partial result towards 
the resolution of this problem. 

Theorem 2. For every rational number v > 1 (resp.. Perron number v > 1) and every integer 
b > 2 there exists a real number ^, such that the base-b expansion of is an automatic (resp., 
morphic) word over the alphabet {0, 1, . . . , 6 — 1} and Vb{i) = v — 1. 

Theorem [2] asserts that the set of values taken by the exponent Vb at automatic irrational real 
numbers is precisely the set of nonnegative rational numbers. 

The real numbers ^ constructed in the proof of Theorem [2] satisfy /x(^) = 1 + Vb{C) when 
Vb{0 — 1> but we do not know their irrationality exponent if Vb{(,) is less than 1. 

We use the same method as in [4J to show the following: 

Corollary 3. For every Perron number /x > 2 there exists a morphic number £, such that /z(^) = /j,. 
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We are currently unable to determine the set of positive algebraic numbers v for which there 
exist b > 2 and a morphic number ^ such that Vf,{C) = v. But this set strictly contains the union 
of the positive rational numbers and the numbers of the form r — 1 with r a Perron number. 

Theorem [T] and Corollary [3] can be phrased in combinatorial terms. Let w = woWiW2 ■ ■ ■ be an 
infinite word over S;, = {0, 1, . . . ,b — 1}, and let < i < j. We say that Wi ■ ■ ■ Wj is a maximal zero 
block in w if ifj = WiJ^i = ■ ■ ■ = Wj = 0, Wj+i ^ 0, and either i = or Wi-i ^ 0. Theorem [T] and 
Corollary [3] can be phrased in terms of the maximal zero blocks in the base-6 expansion of 

Theorem 4. Let w = wqWiW2 ■ ■ ■ be an automatic (resp., morphic) word over {0, 1, . . . , 6 — 1}, 
that does not have a suffix of the form O"^. For k > 0, let {ik,jk) denote the starting and ending 
positions, respectively, of the k'th maximal zero block in w. Then limsupi.^^ jk/ik is finite and 
rational (resp. algebraic). 

Strictly speaking, Theorem[T]is not a restatement of Theorem^ since, to deal with the exponent 
Vfj, we also have to control the occurrences of blocks composed only of the digit 6—1. 

Theorem 5. For every Perron number /i > 2 there exist a binary morphic word w = woWiW2 ■ ■ ■ , 
such that the sequence of indices {nj}j>o = {n '■ Wn = 1} satisfies 

1. Uj^i/rij > 2 for all j > 0; 

2. limsupj_^o^ rij+i/nj = fi. 

Maximal zero blocks are a special case of maximal x-blocks, where x S is an arbitrary 
word. We say that y = Wi ■■■ Wj in w is an x-block if there exist some proper suffix x' and 
proper prefix x" of x, such that y = x'x'^x" for some integer n > 1; if x"wi+i is not a prefix of 
X, and either i = or Wi^ix' is not a suffix of x, then the x-block is maximal. For example, 
let w = 0100111010101000- • • e {0,1}"^, and let x = 01. Then woWiW2 = 010, W3W4 = 01, and 
wq - ■ ■ wi3 = 10101010 are all maximal x-blocks. Theorem [4] can be generalized as follows: 

Theorem 6. Let w = W0W1W2 ■ ■ ■ be an automatic (resp., morphic) word over {0, 1, . . . ,b — 1}, 

and let x G S^. Assume w does not have a suffix of the form x'^. For k > 0, let {ik,jk) denote the 
starting and ending positions, respectively, of the k'th maximal x-block in w. Thenlimsupj.^^jk/ik 
is finite and rational (resp., algebraic). 

Theorems H] and [6] are proved in Section HI Theorems [5] and [2] and Corollary [3] are proved in 
Section [5l 

In light of Theorem [6l it seems plausible that the method used in the proof would allow us to 
say something about the exponent v'f^. This is not the case, however, since we then have to consider 
possible repetitions of every word x G E^, and hence, to take the supremum of an infinite set of 
rational (resp., algebraic) numbers, that may not be all distinct. We cannot guarantee that this 
supremum is rational (resp., algebraic), nor even that it is finite. Thus, we are unfortunately unable 
to establish the following statement: 

Unproven Assertion (i). Let ^ be an irrational real number, and suppose the expansion of 
^ in some integer base b > 2 is an automatic (resp., morphic) word over the alphabet = 
{0, 1, . . . , 6 — 1}. Then the number i'{,(^) is finite and rational (resp., algebraic). 
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If we could prove that for every irrational real number ^, such that the expansion of ^ in some 
integer base 6 > 2 is morphic and has sublinear complexity, the number is finite, then we could 
extend Theorem 2.1 of Adamczewski and Cassaigne [Ij, asserting that the irrationality exponent 
of an automatic number is always finite, as follows: 

Unproven Assertion (ii). A morphic number of sublinear complexity cannot be a Liouville num- 
ber. 

Lemma 5.1 of [1] states that v'^{^) is finite for every irrational automatic number, and is a key 
step in the proof of Theorem 2.1 of jl]. Here, the assumption that ^ is automatic is crucial. The 
other steps of the proof do not require such a strong condition on ^ and can be easily adapted to 
the case where ^ is morphic with sublinear complexity. 

However, for numbers in the class C, Theorem [T] and Equality ([2]) imply the following corollary: 

Corollary 7. Let ^ be an automatic (resp., morphic) number in the class C. Then /x(^) is finite 
and rational (resp., algebraic). 

Corollary [7] is a small step towards the resolution of Problem [TJ 

3 A-blocks in pure morphic words 

Definition 2. Let w = woWiW2 • • • € T,^ ■ Let Sub(w) denote the set of finite subwords of w. An 
occurrence of w is a triple (n, i,j), where e / n G Sub(w) and < i < j, such that Wi - ■ -wj = u. 
We usually denote an occurrence {u,i,j) simply by u. The set of all occurrences of w is denoted 
by Occ(w). An occurrence {u,i,j) € Occ(w) contains an occurrence {u',i',j') G Occ(w), denoted 
u' ~< u, if i < i' and j > j'. 

Definition 3. Let w = W()WiW2 • • • S S'^. Let ACS, and let A = S \ A. An occurrence 
{u,i,j) E Occ(w) is a A-block u € A+. A A-block {u,i,j) G Occ(w) is maximal if tfj+i € A, 
and either i = or Wi-i € A. 

Our goal in this section is to prove the following theorem: 

Theorem 8. Let /i : S* ^ S* 6e a nonerasing morphism, and let w = woWiW2--- = h^{wo). 
Let A C. Ti be a nonempty subalphabet, such that w contain infinitely many letters of A and A- 
blocks of unbounded length. For k = 0,1,2, ... , let {ik,jk) denote the starting and ending positions, 
respectively, of the k'th maximal A-block in w. Then limsupj^^^ jk/ik is an algebraic number of 
degree at most If h is also uniform, then limsupf._,^ jk/ik is rational. 

We require w to contain A-blocks of unbounded length because otherwise limsupf.^^jk/ik 
is trivially rational. This condition implies in particular that w is aperiodic, that is, it is not 
ultimately periodic. Here an ultimately periodic word is a word of the form w = xy^ , where a; G S* 
and y E 

Proving Theorem [8] will enable us to prove the algebraicity (resp., rationality) of the sequence 
of zero blocks in morphic (resp., automatic) words in general: if w = T{h^{a)), where r is a 
coding, then a maximal zero block in w is the image under r of a maximal A-block in (a) , where 
A = T^^(O). The case of maximal x-blocks will be proved by applying a morphic-preserving (resp., 
automaticity-preserving) transformation to w. 
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The technique we use to prove Theorem [8] is very similar to the technique used to prove the 
algebraicity of critical exponents in pure morphic words [6l[7j. The idea is as follows: 

1. The sequence of maximal A-blocks can be partitioned into subsequences, where for each 
subsequence, every element is an image under h of the previous element, up to a small change 
at the edges. 

2. There are only finitely many different such subsequences in w. Since we are interested in 
limsup, it is enough to consider only the first of each of the different subsequences. 

3. The limsup of a subsequence can be computed using the incidence matrix of h (see Defini- 
tion H]). In particular, the limsup is a rational expression of the eigenvalues of the said matrix, 
which are algebraic numbers of degree at most 

4. When h is uniform, the expression turns out to be rational. 

Definition 4. Let S = Il„ = {0, 1, . . . , n — 1}, let /i : S* ^ S* , and let u G S* . The Parikh vector 
of u, denoted by [u], is a vector of size n that counts how many times different letters occur in u: 
[u] = (|u|o, |ii|i, . . . , The incidence matrix associated with h, denoted by A(h), is an n x n 

matrix, whose j'th column is the Parikh vector of h{j): 

A{h) = {aij)o<i,j<n ; aij = \h{j)\i . 

Proposition 9. Let h : T,* ^ T,* , and let A = A{h). Then: 

1. [h{u)] = A[u] for all u G S*; 

2. A{¥') = A" for all n G N. 
See, e.g., [2l Section 8.2]. 

Notation: for a word w (finite or not), alph(zi;) denotes the set of letters occurring in w. 

Lemma 10. Let /i : S* — > S*. Then there exists some power g of h such that for all a T, and 
for all n > 1, sdph{g'^ (a)) = alph{g{a)) . 

Proof Let A = A{h) = (aij), and denote A"- = A(/i") = (^J). Then for ah letters a,b e T. and 

for all n > 1, 6 G alph(/i"'(a)) if and only if \h'^{a)\b > 0, that is, if and only if > 0. Since we 
care only about the zero pattern of A^ and not about the value of the non-zero entries, it is enough 
to consider ^4 as a boolean matrix. Let S be a |S| x |S| boolean matrix, such that bij = if and 
only if Oj.j = 0. Then it is enough to prove the following: there exists some power B' of B, such 
that S'" = B' for all n > 1. 

Since there are only finitely many boolean matrices of a given size, there exist some integers 
t > and c > 1 such that 5* = 5*+^ and so B^+^ = 5*+*^+"^ for all /c G {0, 1, . . . , c - 1} and for 
ah ra > 0. Choose a A; G {0, 1, . . . , c - 1} such that c\t + k, and let B' = B^^^ . Then t + k = mcioT 
some integer m, and for all n > 1, 

By setting g = h^^^ we get the desired morphism. □ 
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Let w = h^(a) be a pure morphic word over S. Then w = {h*)^{a) for all t > 1, and so we can 
replace h by some convenient power. Therefore, by Lemma [TOl we can assume the following: 

Assumption 1. For all a € 5] and for all n > 1, alph(/i"(a)) = alph(/i(a)). 

In addition, for the rest of this section we assume that h is nonerasing. 

Definition 5. Let /i : S* — > S*, and let w = wqWiW2 ■ ■ ■ = h^{wQ). The inverse image under h of 
an occurrence u E Occ(w), denoted /i^^(n), is the shortest occurrence v € Occ(w) such that h{v) 
contains u. 

Note that for an occurrence u (rather than a subword u), the inverse image is well defined. 
Whenever we use the notation h~^{u) it should be understood that u is an occurrence. 

In the next two lemmas, we want to establish the following idea: if a pure morphic word w = 
h'^(wo) contains A-blocks of unbounded length and infinitely many letters of A, then sufficiently 
long A-blocks are images under h of other A-blocks, except perhaps for edges of a bounded length. 

Lemma 11. Let w = woWiW2 ■ ■ ■ = h^{wQ) be an aperiodic pure morphic word over an alphabet 
T,, where h is nonerasing and satisfies AssumptionUl and let M = max{|/i(a)| : a G S}. Let 
A CTi be a nonempty subalphabet, such that w contains infinitely many letters of A. and A-blocks 
of unbounded length. Let u = Wr ■ ■ ■ Ws G Occ(w) be a maximal A-block, such that \u\ > and 
r > M, and let h^^{u) = Wi ■ ■ ■ Wj. Then 

1. WiJ^M ■ ■ ■ Wj-M is a (not necessarily maximal) A-block; 

2. Wi-M+i • • • ^i+M-i contains a letter of A; 

3. Wj-M+i ■ ■ ■ Wj^M-i contains a letter of A. 

Proof. Suppose there is a letter a occurring in u such that b := h~^{a) G A. Let c = h~^{b) (since 
a occurs at a position k > M > \h{wo)\, h~^{b) is well-defined). Then by Assumption [H /i^(c) 
contains 6, and so /i^(c) = h^{h~'^{a)) is not contained in u. This implies that b (which is contained 
in h{c)) occurs at a distance of at most M from the edges of h~^{u). Therefore, Wi+M • • • Wj-M is 
a A-block. 

Now consider Wi-M+i ■ ■ ■ 'U^«+M-i- Since u is maximal, d := Wr-i G A. Let e = h^'^{d) (again, 
h^'^{d) is well-defined, since r — 1 > M). Then h{e) contains d by Assumption [TJ But h{e) 
is contained in Wi-M+i • • -Wi+M-ii and so Wi-M+i • ■ "U^i+Af-i contains a letter of A. Similarly, 
Wj^Ai+i ■ ■ ■ Wj-f-M-i contains a letter of A. □ 

Lemma 12. Under the conditions of Lemma Mli let u = Wi ■ ■ ■ Wj G Occ(w) be a maximal A-block, 
such that \u\ > and i > M. Then 

1. h{wi+M • • • Wj-M) is a A-block; 

2. h{wi-M+i ■ ■ ■ Wi+M-i) contains a letter of A; 

3. h{wj-M+i • • ■Wjj^M-i) contains a letter of A. 

Proof. Suppose h{u) contains a letter 6 G A. Then there exists a letter a G A such that h{a) 
contains b. By the same argument as in the proof of Lemma [TTl h{h~^{a)) contains b, and cannot 
be contained in u. We get that a occurs at a distance of at most M from the edges of u, and so 
h{wi+M ■ ■ ■ Wj-M) is a A-block. The rest is proved similarly. □ 
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Corollary 13. Under the conditions of Lemma [Tl[ the set of maximal A-blocks u = Wi ■ ■ ■ wj that 
satisfy i > M and \u\ > can be partitioned into (infinitely many) sequences, each of which has 
the form u^^\ u^-^\ u^'^\ ■ ■ ■ , where for all k > 0: 

1. u^^^ is a maximal A-block; 

2. ifu'^'^) = Wi - --Wj, then h{wi+M ■ ■ ■ Wj-u) -< n(''+^) -< h{wi-M+i ■ ■ ■ wj+m-i)- 

Definition 6. Let w = W()WiW2 ■ ■ ■ = h^{wo) be an aperiodic pure morphic word over an alphabet 
S, let M = max{|/i(a)| : a e S}, and let A C S. A A-sequence in w is a sequence u^'^^ = Wi^, ■ ■ ■ Wj^ 
of maximal A-blocks, where for all A; > 

• ik> M- 

• > M2; and 

• h{wik+M ■ ■ ■ wj^^m) ^ n(*=+i) -< h{wi^^M+i ■ • • Wj^^+M-i)- 

Definition 7. Let w = wqWiW2 ■ ■ ■ = h^{wQ) and let {u^''^}k>o be a A-sequence. For k > 0, let 
= Wr,+^ ■ ■ ■ Ws^^, and h{wj^) = Wm,+, ■ ■ ■ Wn^^,. Then 

• u^^'^^'^ is growing on the left if ik+i < '"fc+i; 

• nC^+i) is shrinking on the left if i^+i > r^+i; 

• u^^~^^^ is stationary on the left if ik+i = r^+i- 

Similarly, u^^~^^^ is growing on the right if jk+i > n^^i, shrinking on the right if jk+i < n-k+i, 
and stationary on the right if jk+i = "rafc+i- The left stretch of u^''^^\ denoted by a^^~^^\ is the 
word that occurs between the left edge of u^^~^^^ and the left edge of h{v!^^'>). That is, if u'^^^^^ is 
shrinking on the left, then a^^'^^'^ := li'rfc+i ■ ■ ■ ^ife+i-i (iii this case we say that the left stretch is 
negative); if u^^'^^^ is growing on the left, then cr'^^^^^ ■= Wii,_^^ ■ ■ ■ 10^^+1-1 (in this case we say that 
the left stretch is positive). Note that if positive then it is contained in ' , and if it 

is negative then it borders tt('^+-^) on the left. If r^+i = ik+i then fj^'^"'"^) := e. The right stretch, 
denoted by p^^\ is defined similarly. 

(k) 

The fe'th left pivot, denoted hy p)^ , is the rightmost letter in Wii^-M+i ■ ■ ■ ?Wjfc+M-i such that 
h{p^L^) contains a letter of A; that is, p^l^^ := h^^{vui^^-^-i). The right pivot, denoted by p^^\ is 
defined similarly. 

Figure [T] illustrates Definition [71 

Lemma 14. Let w = W0W1W2 ■ ■ ■ = ^^{wq), where h is nonerasing and satisfies Assumption\^ and 
let {n*^'^)}fc>o be a A-sequence. Then the sequences {cy^^^}k>i! {p^^^}k>i ultimately periodic. 

Proof. We prove the lemma for the left stretch. The proof for the right stretch is similar. 

For A; > 0, let p^''^ = Wp,^. Then /i(wpo+i ' ' ' '"^jo+Af-i) ^ A+, and so h'^iwp^^+i ■ ■ ■ Wi^+M-i) G 
A+ for all n > (recall Assumption [1|) . In particular, h'^{wpg+i ■ ■ -vuig^M-i) G A"*", and so p^^^ 
cannot occur in h{wpg+i ■ ■ ■ Wiq+m^i). On the other hand, alph(/i^(wp„)) = alph.{h{wp^)) , and so 
h{wpg) contains a letter a such that h{a) contains a letter of A. In particular, p^^ -< h{p^^^). More 
generally, for all A; > 0, is the rightmost letter a in h{p^^ ^^) such that h{a) contains a letter of 
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A. This implies that the sequence of left pivots, {p\^ }fc>0) is ultimately periodic: since S is finite, 
there exist some k ^ m such that p^^^ = p^[^\ and so p^^'^"'^ = p^^"-* for all n > 1. 

Now consider the left stretch. Let h[p^^) = Wxi ■ ■ -Wyi- By definition, either ii = yi + \ (if the 
rightmost A letter is the last letter of h{p^p)), or xi < ii < yi. In the first case, Wi-^ -< h{wp^^j^i), 
and so h{wi^) € A+, and u^"^^ is either growing or stationary. Since h{p^^^) contains p^^ (that is, 

h'^iPi^) contains a letter of A), we get that cr'^^-' is a sufhx of h?{p^^^)- 

If xi < i\ < yi, then u^^^ can be also shrinking. However, in this case both h{wi^) and Wi^ are 
contained in /i^(p^°^), and so cr^^) ^ h?(p^^^). Similarly, cr^*') -< h{p^^~^^) for all k >2. This implies 
that {(T^''^}k>i is ultimately periodic. □ 

Let {u^^^}k>Q be a A-sequence. Since both {(J^''''}k>i and {p^'^^}k>i arc ultimately periodic 
sequences, the sequence {{o'^^\ p^^^)}k>i is also ultimately periodic. By ignoring the first few 
elements of the sequence we can assume it is purely periodic; by replacing h by /i^, where p is the 
period, we can partition {{o-^''\ p^^^)}k>i into p subsequences, where each subsequence has period 
1. We now compute ik and jk for a sequence of maximal A-blocks, assuming that cr^*^^ and p^'^^ are 
fixed. 

Lemma 15. Let {u^'^^}k>o be a A-sequence, and assume that a^^^ = a and p^^^ = p for all k > 0. 

Let A be the incidence matrix of h, and let 1 be the all ones vector of size 1 x Then there 

exist integral vectors U,V,X,Y of size |S| x 1, where U and V are nonnegative and nonzero, and 
a constant c, such that for all k >0, 

-jk = ik + l {a'^U + (E^li A'^)y) - 1; 
• jk/ik < c. 
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Proof. Let v'^^^ = wq ■ ■ ■ Wj^-i. Then ik = [v*^*^^! = and jk = ik + \u^^'^ \ — l = ik + llu^^^ — 1. 

To compute ik and jk we need to compute [u^'^^] and [v^'^^]. 

Let U = [u^^^] and V = [v^^^]. Since io > v^^^ is a nonempty word, and so both U and y are 
nonnegative, nonzero vectors. Let us assume for the moment that p = e. Depending on whether a 
is positive or negative, there are two possible situations: 

1. h{u^^^) = au'^^^^^ (fj is negative); 

2. ah{u'^^'>) = u^^^^^ {a is positive). 

Suppose a is negative. Then h{u^^^) = au^^\ h?'{u'^^) = h{au^^^) = h{a)h{u^^^) = h{a)au^'^\ and 
by induction, h^{u^^^) = h^~^{a)h''^'^{a) ■ ■ ■ h{a)au^^\ By Proposition [9l we get that for all A; > 0, 

[u^^)] = [/,'=(^(0))] - [h'-\a)] - [h'-\<y)] [a]=A'U- {y.A M- 

\£=o / 

Now suppose that a is positive. Then u^^^ = ah{u^^^), u^"^^ = oh{u^^^) = ah{(7)h?{u^^^), and by 
induction, u'^^'> = ah{o) ■ ■ ■ h^~^{a)h''{u^^^). By Proposition [9l we get that for all A; > 0, 

\e=o J 

If P^e, then, depending on its sign, we get that [n^^)] = A'^U ± (Y^iZq ^^)H ± (Y^iZq ^'^)[p]- 
Let Y = ±[a]± [p]. Then for all k>0, 



uW] = A'^U ■ 



Ke=o / 



Similarly, [v^''^] = A^V + (Eto if ^ is negative, and [v^^^] = A^V - (E^^=o if 

is positive (here the roles are inverted: if a is negative then it is positive with respect to v^^\ and 
vice versa). Let X = T[f]- Then for all A; > 0, 



V 



(fc)l 



)] = A^V+\Y,A'\X. 



It remains to show that the sequence jk/ik is bounded by a constant. Let w'^^'> = uij^+i • • • Wi^^-^-i. 
Since maximal A-blocks are disjoint and separated by at least one letter from A, \w^^'^ \ > 1 for all 
k>0. Now, for all k > 1, 

= |/i('u('=-i))| ± |o-| ± |/9[ < Mlu^'^-i)] + 2M , 
\yik)\ = |^(fc-i)^(fc-i)y^(fc-i)| > , 

and so 

jk lu^*^)!-! . _ 2M 



Ik if^'^)! 

This completes the proof of the lemma. □ 
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The following theorem was proved in [6', T] : 

Theorem 16. Let A be an nxn nonnegative integral matrix with no zero columns, and let U, V, W 
be nonnegative integral column vectors of size n, with W ^ 0. Let 

l(/l*f/+(Et„'-4-)V 



^'^= UAHV) 

Then 

1. {^{k)}k>o has finitely many accumulation points; 

2. if a is a finite accumulation point of T , then a is a rational expression of the eigenvalues of 
A. In particular, a is algebraic of degree at most n. 

The proof of Theorem [16] can be adapted, with slight changes, to the case of the sequence 
{jk/'i'k}k>o- Here we have a sequence of the form 

j, |n(^)l-l ^ (a'U + (Eto A')y) - I 

\v^^^ I 1 i^AW + E™-^ A^)X^ 

where A is a |S| x |S| nonnegative integral matrix with no zero columns (recall that h is nonerasing), 
U and V are nonnegative integral vectors, both nonzero, and X and Y are integral vectors, with 
possibly negative entries. However, since both \u^^^\ and \v^^^\ are tending to infinity as k tends 
to infinity, both nominator and denominator are always positive. In particular, the fact that X 
and Y may contain negative entries does not alter the result. Also, Lemma [15] implies that all 
accumulation points are finite. 

Corollary 17. Under the conditions of Lemma \15\ limsup^j^ooifc/^fc an algebraic number of 
degree at most 

Lemma 18. Under the conditions of Lemma \15l if h is uniform then liiuk^oo jk/ik exists and is 
rational. 

Proof. If /i is an m-uniform morphism, then = m\w\ for all w € S*. Let u = lu*-^-*!, v = \v^'^^\, 

y = ±|cr| ± \p\, and x = ^\(7\. Then the expressions for lu^'^^l and \v^^^\ are reduced to 

k—l ^ 



\u^ '\ = m |n| + ( > m )y = m \u\+y- 



k-l 



— 1 



m 



Therefore, 

Jk 



_ , , k^l-i m^H + y^^r-i 7 > ^^(m-i)H + y 

— = H r-TTT, — = H ; — k ^ oo 1 + — 

Zk rn^'lvl+x^^ (m - l)\v\ + x 



□ 
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Proof of TheoremlEi Let M = max{|/i(a)| : a £ E}. Since we are interested in limsupjfc/ifc, 
it is enough to consider only A-blocks of size larger than Af^ that occur at an index i > M. 
By Corollary [T3l these A-blocks can be partitioned into sequences, where for each sequence, an 
element is the image under h of the previous element, save perhaps for edges of a bounded length. 
Let u = Wi ■ ■ ■ Wj be the first element of such a sequence. Then |m| > and i > M, and so by 
Lemma [TT| v := h~^{u) is a maximal A-block (up to the edges); however, |f | < M^, or it would be 
part of the sequence itself. Taking into account the occurrences of size M on both sides of v, we get 
that each sequence is uniquely determined by a subword of w of length at most + 2M. Since 
there are only finitely many such subwords, there are only finitely many different such sequences. To 
compute the limsup, it is enough to consider only the first of each of the different sequences, where 
ik is the smallest. Therefore, we need to consider only finitely many sequences. Each sequence 
can be further partitioned into finitely many subsequences, where for each of those, limsup jk/ik 
is algebraic of degree at most |S| (Lemma llSp . For uniform morphisms, limsup jk/ik is rational 
(Lemma HH]). □ 

4 A-blocks and x-blocks in morphic words 

In this section we extend Theorem [8] to morphic words in general, as described in the beginning 
of Section [3l First, the next theorem shows that we lose no generality by restricting ourself to 
nonerasing morphisms: 

Theorem 19 ([21 Theorem 7.5.1]). Every pure morphic word is the the image under a coding of a 
pure morphic word generated by a nonerasing morphism. 

Theorem 20. Let w be a morphic word over a finite alphabet S. Let A CT, be a nonempty proper 
subalphabet, such that w contain infinitely many letters of A and A-blocks of unbounded length. 
For k = 0,1,2, .. ., let u^'^^ = Wi^ ■ ■ ■ Wj^ be the k 'th maximal A-block in w. Then limsup^j^o^ jk/ik 
is algebraic. If w is also automatic then limsupi^_,^ jk/ik is rational. 

Proof. Since w is morphic, there exists some alphabet S', a morphism h : T,'* — > S'*, and a coding 
r : S'* Ti*, such that w = T{h^[a)) for some a G S'. By Theorem I19| we can assume that h 
is nonerasing. Let A' = r^^(A). Then every maximal A-block in w is the image under r of a 
maximal A'-block in h^{a), and every maximal A'-block in h^{a) is mapped by r to a maximal 
A-block in w. The result follows from Theorem [8l □ 



Proof of Theorem\^ Let = d, and let X = {{u^^\ik-, ik) ^ Occ{v^) : /c > 0}. First, we 
partition X into d subsequences, Xq, . . . Xm-i, where 

Xm = {{u, r,s) £ X : u = x'x"'x" , x' is a proper prefix of x, and r -|- |x'| = m (mod d)}. 

That is, Xi is the sequence of maximal x-block for which x itself begins at an index equivalent to 
m (mod d). For m = 0,1, . . . ,d — 1 and for A; = 0, 1, 2, . . ., let {ik,rm jk,m) denote the starting and 
ending positions of the fc'th element of Xm- Then 



Proof of Theorem\^ Set A = {0} and apply Theorem [20l 



□ 




(3) 
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Let T,' = {[uq ■ ■ ■ ad-i] '■ clq - ■ ■ a^-i G Sub(w)}, and define d infinite words over S' by 



Wo = [wq--- Wd-i] [wd--- W2d-i] ■■■ , 

Wi = [Wl--- Wd] [Wd+1 ■ ■ ■ W2d] ■■■ , 
Wd-l = [Wd-l ■ ■ ■ W2d-2] [W2d-1 ■ ■ ■ W3d-2] • ' ' ■ 

By [2^, Theorem 7.9.1], if w is morphic then is morphic for all m; by [5], if w is automatic 
then is automatic for all m. Let L = S U {a}, where a ^ E, and define a d-uniform morphism 
t:T,'* ^ r* by 

a'^, if ao • • • ad-i = x; 

ao--- a-d-i, if ao • • • ad-i / x. 



T([ao • • • ad-i]) 



Let Vm = T(wm), m = 0, ... ,d — 1. By [2, Corollary 7.7.5, Corollary 6.8.3], if is morphic 
(resp., automatic), then so is Vj72. Let A — {c^}; and let {j-k^mi '^k,m) denote the starting and 
ending positions of the /c'th maximal A-block in v^T^. TliGii for all ^ ^ 0, |'?^/c,m — ^/L,m| ^ d 
and \sk,m - jfc,m| < d, and so \\Ta.s\xpk^^Sk,m/rk,m = limsup^^,^ ifc^^/ifc^^. By Theorem [201 
\im.sn\)i.^^ Sk,m/T^k,m is algebraic (resp., rational) if is morphic (resp., automatic), and so 
limsup^j^o^ jfc_m/^fc,m is algebraic (resp., rational) if w is morphic (resp., automatic). By ([3]), the 
result follows. □ 



5 Perron numbers as irrationality exponents of morphic numbers 



Proof of Theorem [5l Let /i > 1 be a Perron number. Then there exists a primitive integral 
square matrix A, of size k x k for some positive integer k, such that r{A) = n, where r{A) is the 
Perron-Frobenius eigenvalue of A |8, Theorem 11.1.4]. We may assume that k > 2: if ^ is not 
integral then necessarily k > 2, and if fi is integral we can set A to be the 2 x 2 matrix (^^i;^ ''^ ^). 
Let S = Sfc = {0, 1, . . . ,k — 1}, and let /i : S* ^ S* be a morphism such that A{h) = A. Then 

\h^iO)\ _ lA-[0] 



|/i"-i(0)| l^"-i[0] 

Since /u is a Perron number, the Jordan decomposition of A has one block of size 1 associated with 
/i, and it is easy to check that 

\hHO)\ 
iim ■- , , , , = u. 

n^oo |/l"-l(0)| 

For /i > 2, the above equation implies that |/i"(0)|/|/i"^^ (0)| > 2 for n sufficiently large; for /U = 2, 
we let A = ([ I), and get that |/i"(0)|/|/i"'^(0)| = 2 for ah n. Note that we do not require h to be 
prolongable on 0; we consider only the finite words {/i"'(0)}„>o- 

Let r = {a, /?} U S. Define a morphism 5 : F* — > F* by g{a) = a/30, g{(3) = f3, and g{i) = h{i) 
for i G S. Let u = g'^{a). It is an easy induction to show that 

u = a-/3-0-/3-/i(0) •/?-/i2(0) •/?■•• . 
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Now let r : r* — > {0, 1}* be the coding that maps /? to 1 and all other letters to 0, and let w = t(u). 
Then 

w = • 1 • 0^0 • 1 • • 1 • 0^2 . 1 . . . ^ 

where Xn = |/i"(0)|. Thus, w is a morphic sequence that satisfies the conditions of Theorem O 
This completes the proof of the theorem. □ 

Proof of Theorem [2| and of Corollary O For morphic numbers, let /i > 1 be a Perron num- 
ber, and let 6 > 2 be an integer. Let w be as in the proof of Theorem [5l and let {n.j}j>o = '■ 
vjn = 1}. We associate with w the real number 

i>o 

Since limsupj_>oo nj+i/nj = ^u, we get that 

Vbii^) = M - 1. 

Also, if ^ > 2 then belongs to class C, and so /i(Cw) = This follows from the fact that 
nj+i > 2nj for j sufficiently large, and this enables us to use the Folding Lemma as in [9l S] to 
construct the continued fraction expansion of a rational translate of . 

Now consider automatic numbers. For v = 1, we can choose any morphic binary word that does 
not contain unbounded 0-blocks or 1-blocks (e.g., the Thue-Morse word) to be the base-6 expansion 
of ^, where 6 > 2 is any integer, and get that = = f — 1. Suppose v = p/q > 1. We define a 
binary infinite word u = uoUiU2 • • • by letting n„ = 1 if and only if n belongs to the set 

U{(p)/,(p+i)/,...,M/}. 

h>0 

Then u is p-automatic, because its p-kernel contains only two sequences, namely the sequence 
0"^ and the sequence qpiip-p+^Q'^ (see [21 Theorem 6.6.2]). Let {nj}j>o = {n : n„ = 1}. Since 
{p + l)/p < p/q, we have 

Uj+i p.ph+l p 

lim sup — — = lim r- = - . 

j^oo Uj h^oo qp-p^ q 

Consequently, the real number 

i>i 

satisfies 

VbiU = ^--l. 

q 

□ 

Remark 1 . We stress that, with the above construction for automatic numbers, we do not know 
the value of A*(Cu)) because the condition nj+i/nj > 2 for all j sufficiently large is not satisfied. 
This is not the case with the slightly more complicated construction given in [3] , which works under 
the assumption that p/q exceeds 2. 
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Remark 2. For morphic numbers, we can go a bit further. Keep the notation of the proof of 
Theorem [5l Let F = {a, /3, 7} U S. Let a and b be non-negative integers. Define a morphism 
g -.T* ^T* by g{a) = a/30(70)'^/30(70)^ g{P) = /?, 5(7) = 7 and g{i) = h{i) for i G S. Let 
u = g'^{a). It is an easy induction to show that 

u = a • /? • • (7 • 0)'^ • /? • • (7 • 0)^ • /3 • /i(0) • (7 • /i(0))" • (3 ■ /i(0) • (7 • /i(0))^ • /3 • • • . 

Now let r : F* — > {0, 1}* be the coding that maps /? to 1 and all other letters to 0, and let w = t(u). 
With suitable choices for a and 6, the value of the exponent f ^ at the corresponding morphic number 
is neither rational, nor a Perron number minus 1. However, we do not know whether every positive 
algebraic number can be attained. 
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